Temporal difference learning

Results: 95



#Item
11Evolutionary Feature Evaluation for Online Reinforcement Learning

Evolutionary Feature Evaluation for Online Reinforcement Learning

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
12Continuous Deep Q-Learning with Model-based Acceleration  arXiv:1603.00748v1 [cs.LG] 2 Mar 2016 Shixiang Gu1 2 3 SG 717@ CAM . AC . UK

Continuous Deep Q-Learning with Model-based Acceleration arXiv:1603.00748v1 [cs.LG] 2 Mar 2016 Shixiang Gu1 2 3 SG 717@ CAM . AC . UK

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2016-03-02 20:31:58
13AITF ANNUAL REPORT 2016 DR. RICHARD SUTTON REINFORCEMENT LEARNING AND ARTIFICIAL INTELLIGENCE AITF ANNUAL REPORT MARCH 31, EXECUTIVE SUMMARY

AITF ANNUAL REPORT 2016 DR. RICHARD SUTTON REINFORCEMENT LEARNING AND ARTIFICIAL INTELLIGENCE AITF ANNUAL REPORT MARCH 31, EXECUTIVE SUMMARY

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2016-05-16 19:35:21
14fourteen declarative principles of experience-oriented intelligence 1. all goals and purposes can be well thought of as the maximization of the expected value of the cumulative sum of a single externally received number

fourteen declarative principles of experience-oriented intelligence 1. all goals and purposes can be well thought of as the maximization of the expected value of the cumulative sum of a single externally received number

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2009-03-27 16:18:08
15GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces Hamid Reza Maei and Richard S. Sutton Reinforcement Learning and Artificial Intelligence Laboratory, University of

GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces Hamid Reza Maei and Richard S. Sutton Reinforcement Learning and Artificial Intelligence Laboratory, University of

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2010-01-22 02:08:08
16Sutton, Richard  PIN

Sutton, Richard PIN

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2013-10-18 16:05:54
17Natural Temporal Difference Learning

Natural Temporal Difference Learning

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2014-11-06 09:18:20
18An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning arXiv:1503.04269v1 [cs.LG] 14 MarRichard S. Sutton

An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning arXiv:1503.04269v1 [cs.LG] 14 MarRichard S. Sutton

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2015-03-16 20:16:49
19Playing Atari with Deep Reinforcement Learning  Volodymyr Mnih Koray Kavukcuoglu

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2013-12-19 20:23:45
20Value Learning and Arousal in the Extinction of Probabilistic Rewards: The Role of Dopamine in a Modified Temporal Difference Model Minryung R. Song1, Jean-Marc Fellous2,3,4* 1 Department of Bio and Brain Engineering, Ko

Value Learning and Arousal in the Extinction of Probabilistic Rewards: The Role of Dopamine in a Modified Temporal Difference Model Minryung R. Song1, Jean-Marc Fellous2,3,4* 1 Department of Bio and Brain Engineering, Ko

Add to Reading List

Source URL: amygdala.psychdept.arizona.edu

Language: English - Date: 2014-06-10 21:21:48